CDS

Accession Number TCMCG075C17674
gbkey CDS
Protein Id XP_007030256.2
Location complement(join(35361746..35362435,35362827..35362934,35363332..35363446,35363747..35365056))
Gene LOC18599976
GeneID 18599976
Organism Theobroma cacao

Protein

Length 740aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007030194.2
Definition PREDICTED: U11/U12 small nuclear ribonucleoprotein 48 kDa protein [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description U11 U12 small nuclear ribonucleoprotein 48 kDa
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K13156        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGAATCCTTCTTCAATTCAACCCTCTCTCCCTTCCCAAAACCCTAACCCTAATTCCACAATCCCGTCCCTCCCCCAAAACTCGAACCTCAATGGCCCTTCCTCTCTCTCCACAACTCTCTCCTCCCTCACCGCCCTCCTATCTCTCTCCCACCAAACCCTCAATTCCCATTCTACCCTCACCAAATCCCTAAACCCCAACCTTATCCCTTGCCCTTTCAACCCCAATCATCTCCTCGCCCCAGAATCCCTCTTCTCCCATTCCCTCCGCTGCCCTTTCCCTCAAAATCTCGACCTTTACCCTCCAAATTACCGAAATACCCTCATACCCCCTTCAAATTTACATGCCCAGGACACCCATTTTCAAGGTATACAATGCTCTGAACTTTGCCTCTCTTTAGATGAATACTTTGCTGATTTTGGGTCAAATTTCTTTTGCAAAGATTGTCCTGCTGCTGTCAATTTGTTTGATATTGATAACTCCAAAAAAACGTTTACTTTACCGGGTTTTTTATCCGTTGAATGTGTTAATTTTGAGGGTTTTAATGAACGAGAGGGTGTGGTTAGTGAAGAAAAGGGTTTAAGGGTTTTAGCTTCAGGCTTATGGGAAATTAGGAGGGAAGTTGAGAGGTGGGGCGATTACCCTGGTTCGTATTCGTTTAATGTTATTTGTGCGATTTTGGGGTCGAAAATGGTGAAAGGAAGTAATTTGAGGAAATGGATTGTTGCTAATTCGCCACGGTATGGTGTTATGATTGATGGGTGTATGGGGGATCATATAGTTGTGCTGGTTAGGTTGTGTTTGAAAGCTGTTGTGAGGGAAGCGGTTGGTTTGATGGAAGTAGAGATGGGATATGGAGAAGCTAAGGAGAAAGAATGGGATGTGAATTTGCTGACGAGAATGTTTGAATGTCCTATTTTGCTTCAAGTTCTAGTGTGGTTAGGTTCTCAGCTTTCTGTTTTGTATGGTGATGTTAATGGGAAATTCTTTGCAATTAATATGATCAAGCAATGTGTATTAGAAGGTGCATCACTGTTGTTGTTGTTTCCATTGGAAGAAAAGGTGACTGATTCACACAATTTGGGACAAGAATCACAGAGTTTGGACGCTAATGGTGTTAAGGAAATTAAACTTGAGGAGACAATTGAGCAAAGTAATGAACCAGTTGAAACAGTCAATGAAACTATTGGTGTTGGGGTGATATTTGTATCCCAGGTAGCAGCAGCAGTTGCCGCATTGCATGAACGGTGCTTCCTTGAAGAAAAAATAAAGCATTTACGGGGTTTACAACAACTTTCAAGATATCAGCGGATGGCCGAGCATGCTTATGTCTCTGAGAGAGCTGATGCGGAGCGGAAGAAACGTCCCAACTATAGGCCTATAATTGACCATGATGGGCTTCCTCGGCAGGCGTCATCTAATGAGGAAACAAGTACGACTAAAACAAGAGAGGAGATATTGGCCGAAGAAAGAGACTATAAAAGACGAAGAATGTCATATCGTGGGAAGAAATTGAAGCGGACAGCATTACAGGTTATGAGGGATATAATAGAGGAATACACAGAGGAAATCAAGAAAGCTGGGAGGATTGGTTGCTTTGTCAAAGGAGTGGAAGAGGAAGGGTTGTTACCATCTGAATCACCAGTTCCTTATGACCGTGCTGTGGATGCTGATCAGCATAAGAAAGGTACCAGTGACATTTCTGAAGCAGCCAGACGTAGCCCAAACCATTGCAGGAGAAGATCACATGATGACCAGCATACTAGATCTACAAGATTAGAGGATTCCTCAAGAAATGGACACCATGATCTTCTTGAAGATTCAAGGAGTATGAGTAAAGAGAAACACAGAGACGAGTATCATTCTGGAATCTCAAAAAGATACAGAAGTCATGGGCGGTCAGATGAGCAAAGAAGTCATAGAAGGGAGCGAGATGATGCAGAATCCACTAGATCCACGCACTATGAGAGTGGAAGACGATCTAGTATTTCTAAATATAAGGATTACAAATCATCTTATTCTGCTTCTAATTCTTCAGATGACTTTCATGTAAGAAAGGATGACCAGAAGTTGGATGCTAGAGATAAGAATAGAAGGACTTCATATGAGAATCATACTCCTGGCTCCTGGGTGCAAAATGGATTTGATGATAGATATAATCCTTCAGAATCTGATGACATGTATGAAGATGATGTCTTTGTTAAGTATGTCAGACCAGAATGA
Protein:  
MNPSSIQPSLPSQNPNPNSTIPSLPQNSNLNGPSSLSTTLSSLTALLSLSHQTLNSHSTLTKSLNPNLIPCPFNPNHLLAPESLFSHSLRCPFPQNLDLYPPNYRNTLIPPSNLHAQDTHFQGIQCSELCLSLDEYFADFGSNFFCKDCPAAVNLFDIDNSKKTFTLPGFLSVECVNFEGFNEREGVVSEEKGLRVLASGLWEIRREVERWGDYPGSYSFNVICAILGSKMVKGSNLRKWIVANSPRYGVMIDGCMGDHIVVLVRLCLKAVVREAVGLMEVEMGYGEAKEKEWDVNLLTRMFECPILLQVLVWLGSQLSVLYGDVNGKFFAINMIKQCVLEGASLLLLFPLEEKVTDSHNLGQESQSLDANGVKEIKLEETIEQSNEPVETVNETIGVGVIFVSQVAAAVAALHERCFLEEKIKHLRGLQQLSRYQRMAEHAYVSERADAERKKRPNYRPIIDHDGLPRQASSNEETSTTKTREEILAEERDYKRRRMSYRGKKLKRTALQVMRDIIEEYTEEIKKAGRIGCFVKGVEEEGLLPSESPVPYDRAVDADQHKKGTSDISEAARRSPNHCRRRSHDDQHTRSTRLEDSSRNGHHDLLEDSRSMSKEKHRDEYHSGISKRYRSHGRSDEQRSHRRERDDAESTRSTHYESGRRSSISKYKDYKSSYSASNSSDDFHVRKDDQKLDARDKNRRTSYENHTPGSWVQNGFDDRYNPSESDDMYEDDVFVKYVRPE